Linked Data Triples Enhance Document Relevance Classification

نویسندگان

چکیده

Standardized approaches to relevance classification in information retrieval use generative statistical models identify the presence or absence of certain topics that might make a document relevant searcher. These have been used better predict on basis what is “about”, rather than simple-minded analysis bag words contained within document. In more recent times, this idea has extended by using pre-trained deep learning and text representations, such as GloVe BERT. an external corpus knowledge-base conditions model help about. This paper adopts hybrid approach leverages structure knowledge embedded corpus. particular, reports experiments where linked data triples (subject-predicate-object), constructed from natural language elements are derived learning. evaluated additional latent semantic features for classifier customized news-feed website. The research synthesis current thinking NLP predicate web research. Our indicate increased F-score baseline representations 6% show significant improvement over state-of-the art models, like findings tested empirically validated experimental dataset two standardized pre-classified news sources, namely Reuters 20 News groups datasets.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Pairwise Document Classification for Relevance Feedback

In this paper we present Carnegie Mellon University’s submission to the TREC 2009 Relevance Feedback Track. In this submission we take a classification approach on document pairs to using relevance feedback information. We explore using textual and non-textual document-pair features to classify unjudged documents as relevant or non-relevant, and use this prediction to re-rank a baseline documen...

متن کامل

Linked Data Annotated Document Retrieval

Search engines traditionally suffer drawbacks from ambiguities of natural language, which users often solve via query refinement. In contrast to web search, querying document collections of limited size (e.g. blogs, multimedia collections, or libraries) can quickly lead to empty result sets because the wrong choice of keywords may eliminate the only relevant document. Besides query expansion [1...

متن کامل

Classification DOCUMENT CONTROL DATA

Field trials of the LTS-3 system at Keesler Air Force Base have been extended, and excellent results have been obtained with high-aptitude students, who had been excluded from earlier trials. A study of the use of the LTS for task simulation has led to the implementation of a new student response interpretation feature for the system. Design of the microfiche selector/reader breadboard for LTS-...

متن کامل

BetterRelations: Using a Game to Rate Linked Data Triples

While associations between concepts in our memory have different strengths, explicit strengths of links (edge weights) are missing in Linked Data. In order to build a collection of such edge weights, we created a web-game prototype that ranks triples by importance. In this paper we briefly describe the game, Linked Data preprocessing aspects, and the promising results of an evaluation of the game.

متن کامل

Document Image Retrieval Based on Keyword Spotting Using Relevance Feedback

Keyword Spotting is a well-known method in document image retrieval. In this method, Search in document images is based on query word image. In this Paper, an approach for document image retrieval based on keyword spotting has been proposed. In proposed method, a framework using relevance feedback is presented. Relevance feedback, an interactive and efficient method is used in this paper to imp...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Applied sciences

سال: 2021

ISSN: ['2076-3417']

DOI: https://doi.org/10.3390/app11146636